Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 5354259 |
| Missing cells | 38229948 |
| Missing cells (%) | 34.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 4.7 GiB |
| Average record size in memory | 947.8 B |
Variable types
| Numeric | 4 |
|---|---|
| DateTime | 2 |
| Text | 3 |
| Categorical | 12 |
PERSON_TYPE is highly imbalanced (86.1%) | Imbalance |
PERSON_INJURY is highly imbalanced (66.0%) | Imbalance |
EJECTION is highly imbalanced (93.4%) | Imbalance |
EMOTIONAL_STATUS is highly imbalanced (74.9%) | Imbalance |
BODILY_INJURY is highly imbalanced (69.3%) | Imbalance |
POSITION_IN_VEHICLE is highly imbalanced (52.9%) | Imbalance |
SAFETY_EQUIPMENT is highly imbalanced (60.4%) | Imbalance |
COMPLAINT is highly imbalanced (75.7%) | Imbalance |
VEHICLE_ID has 217425 (4.1%) missing values | Missing |
PERSON_AGE has 573413 (10.7%) missing values | Missing |
EJECTION has 2607425 (48.7%) missing values | Missing |
EMOTIONAL_STATUS has 2525502 (47.2%) missing values | Missing |
BODILY_INJURY has 2525459 (47.2%) missing values | Missing |
POSITION_IN_VEHICLE has 2607042 (48.7%) missing values | Missing |
SAFETY_EQUIPMENT has 2779911 (51.9%) missing values | Missing |
PED_LOCATION has 5267506 (98.4%) missing values | Missing |
PED_ACTION has 5267607 (98.4%) missing values | Missing |
COMPLAINT has 2525452 (47.2%) missing values | Missing |
PED_ROLE has 194889 (3.6%) missing values | Missing |
CONTRIBUTING_FACTOR_1 has 5268834 (98.4%) missing values | Missing |
CONTRIBUTING_FACTOR_2 has 5268945 (98.4%) missing values | Missing |
PERSON_SEX has 600519 (11.2%) missing values | Missing |
PERSON_AGE is highly skewed (γ1 = 71.62917362) | Skewed |
UNIQUE_ID has unique values | Unique |
PERSON_AGE has 547074 (10.2%) zeros | Zeros |
Reproduction
| Analysis started | 2024-05-07 03:32:16.744572 |
|---|---|
| Analysis finished | 2024-05-07 03:35:20.071068 |
| Duration | 3 minutes and 3.33 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
UNIQUE_ID
Real number (ℝ)
UNIQUE 
| Distinct | 5354259 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9035720.7 |
| Minimum | 10922 |
|---|---|
| Maximum | 12968392 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 40.8 MiB |
Quantile statistics
| Minimum | 10922 |
|---|---|
| 5-th percentile | 5802509.9 |
| Q1 | 6980392.5 |
| median | 9333883 |
| Q3 | 11383736 |
| 95-th percentile | 12674086 |
| Maximum | 12968392 |
| Range | 12957470 |
| Interquartile range (IQR) | 4403343 |
Descriptive statistics
| Standard deviation | 2618240.5 |
|---|---|
| Coefficient of variation (CV) | 0.28976554 |
| Kurtosis | -0.07563761 |
| Mean | 9035720.7 |
| Median Absolute Deviation (MAD) | 2206706 |
| Skewness | -0.48457155 |
| Sum | 4.8379589 × 1013 |
| Variance | 6.8551831 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10249006 | 1 | < 0.1% |
| 9412294 | 1 | < 0.1% |
| 7428923 | 1 | < 0.1% |
| 2236482 | 1 | < 0.1% |
| 7207179 | 1 | < 0.1% |
| 9988837 | 1 | < 0.1% |
| 9465489 | 1 | < 0.1% |
| 6107203 | 1 | < 0.1% |
| 10102117 | 1 | < 0.1% |
| 7337158 | 1 | < 0.1% |
| Other values (5354249) | 5354249 |
| Value | Count | Frequency (%) |
| 10922 | 1 | |
| 79660 | 1 | |
| 79953 | 1 | |
| 79954 | 1 | |
| 81004 | 1 | |
| 81073 | 1 | |
| 81886 | 1 | |
| 82012 | 1 | |
| 82146 | 1 | |
| 82227 | 1 |
| Value | Count | Frequency (%) |
| 12968392 | 1 | |
| 12968391 | 1 | |
| 12968390 | 1 | |
| 12968353 | 1 | |
| 12968352 | 1 | |
| 12968345 | 1 | |
| 12968344 | 1 | |
| 12968343 | 1 | |
| 12968342 | 1 | |
| 12968341 | 1 |
COLLISION_ID
Real number (ℝ)
| Distinct | 1456093 |
|---|---|
| Distinct (%) | 27.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3949022.2 |
| Minimum | 37 |
|---|---|
| Maximum | 4722272 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 40.8 MiB |
Quantile statistics
| Minimum | 37 |
|---|---|
| 5-th percentile | 3423239 |
| Q1 | 3677963 |
| median | 4002445 |
| Q3 | 4339124 |
| 95-th percentile | 4645051 |
| Maximum | 4722272 |
| Range | 4722235 |
| Interquartile range (IQR) | 661161 |
Descriptive statistics
| Standard deviation | 651086.15 |
|---|---|
| Coefficient of variation (CV) | 0.16487275 |
| Kurtosis | 17.977044 |
| Mean | 3949022.2 |
| Median Absolute Deviation (MAD) | 329916 |
| Skewness | -3.5019178 |
| Sum | 2.1144088 × 1013 |
| Variance | 4.2391318 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3963775 | 77 | < 0.1% |
| 4691158 | 71 | < 0.1% |
| 3591272 | 66 | < 0.1% |
| 3539636 | 65 | < 0.1% |
| 3504309 | 64 | < 0.1% |
| 3571716 | 62 | < 0.1% |
| 3904409 | 61 | < 0.1% |
| 3691734 | 61 | < 0.1% |
| 4143411 | 60 | < 0.1% |
| 3449201 | 60 | < 0.1% |
| Other values (1456083) | 5353612 |
| Value | Count | Frequency (%) |
| 37 | 1 | |
| 39 | 1 | |
| 40 | 1 | |
| 44 | 1 | |
| 52 | 1 | |
| 55 | 2 | |
| 78 | 1 | |
| 79 | 2 | |
| 104 | 1 | |
| 107 | 1 |
| Value | Count | Frequency (%) |
| 4722272 | 1 | < 0.1% |
| 4722270 | 7 | |
| 4722268 | 3 | |
| 4722265 | 4 | |
| 4722264 | 5 | |
| 4722263 | 3 | |
| 4722260 | 2 | < 0.1% |
| 4722259 | 3 | |
| 4722254 | 2 | < 0.1% |
| 4722253 | 2 | < 0.1% |
CRASH_DATE
Date
| Distinct | 4325 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 40.8 MiB |
| Minimum | 2012-07-01 00:00:00 |
|---|---|
| Maximum | 2024-05-03 00:00:00 |
CRASH_TIME
Date
| Distinct | 1440 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 40.8 MiB |
| Minimum | 2024-05-06 00:00:00 |
|---|---|
| Maximum | 2024-05-06 23:59:00 |
PERSON_ID
Text
| Distinct | 5159436 |
|---|---|
| Distinct (%) | 96.4% |
| Missing | 19 |
| Missing (%) | < 0.1% |
| Memory size | 446.3 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 30.4029 |
| Min length | 1 |
Characters and Unicode
| Total characters | 162784425 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5159404 ? |
|---|---|
| Unique (%) | 96.4% |
Sample
| 1st row | 31aa2bc0-f545-444f-8cdb-f1cb5cf00b89 |
|---|---|
| 2nd row | 4629e500-a73e-48dc-b8fb-53124d124b80 |
| 3rd row | ae48c136-1383-45db-83f4-2a5eecfb7cff |
| 4th row | 2782525 |
| 5th row | e038e18f-40fb-4471-99cf-345eae36e064 |
| Value | Count | Frequency (%) |
| 1 | 142787 | 2.7% |
| 2 | 31734 | 0.6% |
| 3 | 11543 | 0.2% |
| 4 | 4672 | 0.1% |
| 5 | 2005 | < 0.1% |
| 6 | 923 | < 0.1% |
| 7 | 448 | < 0.1% |
| 8 | 235 | < 0.1% |
| 9 | 149 | < 0.1% |
| 10 | 91 | < 0.1% |
| Other values (5159426) | 5159653 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 17478992 | 10.7% |
| 4 | 13032417 | 8.0% |
| 9 | 9762024 | 6.0% |
| 8 | 9750714 | 6.0% |
| b | 9284397 | 5.7% |
| a | 9284146 | 5.7% |
| 1 | 9080464 | 5.6% |
| 2 | 8947375 | 5.5% |
| 3 | 8733405 | 5.4% |
| 7 | 8667294 | 5.3% |
| Other values (7) | 58763197 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 162784425 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| - | 17478992 | 10.7% |
| 4 | 13032417 | 8.0% |
| 9 | 9762024 | 6.0% |
| 8 | 9750714 | 6.0% |
| b | 9284397 | 5.7% |
| a | 9284146 | 5.7% |
| 1 | 9080464 | 5.6% |
| 2 | 8947375 | 5.5% |
| 3 | 8733405 | 5.4% |
| 7 | 8667294 | 5.3% |
| Other values (7) | 58763197 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 162784425 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| - | 17478992 | 10.7% |
| 4 | 13032417 | 8.0% |
| 9 | 9762024 | 6.0% |
| 8 | 9750714 | 6.0% |
| b | 9284397 | 5.7% |
| a | 9284146 | 5.7% |
| 1 | 9080464 | 5.6% |
| 2 | 8947375 | 5.5% |
| 3 | 8733405 | 5.4% |
| 7 | 8667294 | 5.3% |
| Other values (7) | 58763197 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 162784425 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| - | 17478992 | 10.7% |
| 4 | 13032417 | 8.0% |
| 9 | 9762024 | 6.0% |
| 8 | 9750714 | 6.0% |
| b | 9284397 | 5.7% |
| a | 9284146 | 5.7% |
| 1 | 9080464 | 5.6% |
| 2 | 8947375 | 5.5% |
| 3 | 8733405 | 5.4% |
| 7 | 8667294 | 5.3% |
| Other values (7) | 58763197 |
PERSON_TYPE
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.3 MiB |
| Occupant | |
|---|---|
| Pedestrian | 126915 |
| Bicyclist | 67688 |
| Other Motorized | 9201 |
Length
| Max length | 15 |
|---|---|
| Median length | 8 |
| Mean length | 8.0720781 |
| Min length | 8 |
Characters and Unicode
| Total characters | 43219997 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Occupant |
|---|---|
| 2nd row | Occupant |
| 3rd row | Occupant |
| 4th row | Occupant |
| 5th row | Occupant |
Common Values
| Value | Count | Frequency (%) |
| Occupant | 5150455 | |
| Pedestrian | 126915 | 2.4% |
| Bicyclist | 67688 | 1.3% |
| Other Motorized | 9201 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| occupant | 5150455 | |
| pedestrian | 126915 | 2.4% |
| bicyclist | 67688 | 1.3% |
| other | 9201 | 0.2% |
| motorized | 9201 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 10436286 | |
| t | 5363460 | |
| a | 5277370 | |
| n | 5277370 | |
| O | 5159656 | |
| u | 5150455 | |
| p | 5150455 | |
| e | 272232 | 0.6% |
| i | 271492 | 0.6% |
| s | 194603 | 0.5% |
| Other values (11) | 666618 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 43219997 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| c | 10436286 | |
| t | 5363460 | |
| a | 5277370 | |
| n | 5277370 | |
| O | 5159656 | |
| u | 5150455 | |
| p | 5150455 | |
| e | 272232 | 0.6% |
| i | 271492 | 0.6% |
| s | 194603 | 0.5% |
| Other values (11) | 666618 | 1.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 43219997 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| c | 10436286 | |
| t | 5363460 | |
| a | 5277370 | |
| n | 5277370 | |
| O | 5159656 | |
| u | 5150455 | |
| p | 5150455 | |
| e | 272232 | 0.6% |
| i | 271492 | 0.6% |
| s | 194603 | 0.5% |
| Other values (11) | 666618 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 43219997 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| c | 10436286 | |
| t | 5363460 | |
| a | 5277370 | |
| n | 5277370 | |
| O | 5159656 | |
| u | 5150455 | |
| p | 5150455 | |
| e | 272232 | 0.6% |
| i | 271492 | 0.6% |
| s | 194603 | 0.5% |
| Other values (11) | 666618 | 1.5% |
PERSON_INJURY
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 344.7 MiB |
| Unspecified | |
|---|---|
| Injured | |
| Killed | 3127 |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.512691 |
| Min length | 6 |
Characters and Unicode
| Total characters | 56287670 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
Common Values
| Value | Count | Frequency (%) |
| Unspecified | 4702746 | |
| Injured | 648386 | 12.1% |
| Killed | 3127 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| unspecified | 4702746 | |
| injured | 648386 | 12.1% |
| killed | 3127 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 10057005 | |
| i | 9408619 | |
| d | 5354259 | |
| n | 5351132 | |
| U | 4702746 | |
| s | 4702746 | |
| p | 4702746 | |
| c | 4702746 | |
| f | 4702746 | |
| I | 648386 | 1.2% |
| Other values (5) | 1954539 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 56287670 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 10057005 | |
| i | 9408619 | |
| d | 5354259 | |
| n | 5351132 | |
| U | 4702746 | |
| s | 4702746 | |
| p | 4702746 | |
| c | 4702746 | |
| f | 4702746 | |
| I | 648386 | 1.2% |
| Other values (5) | 1954539 | 3.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 56287670 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 10057005 | |
| i | 9408619 | |
| d | 5354259 | |
| n | 5351132 | |
| U | 4702746 | |
| s | 4702746 | |
| p | 4702746 | |
| c | 4702746 | |
| f | 4702746 | |
| I | 648386 | 1.2% |
| Other values (5) | 1954539 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 56287670 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 10057005 | |
| i | 9408619 | |
| d | 5354259 | |
| n | 5351132 | |
| U | 4702746 | |
| s | 4702746 | |
| p | 4702746 | |
| c | 4702746 | |
| f | 4702746 | |
| I | 648386 | 1.2% |
| Other values (5) | 1954539 | 3.5% |
VEHICLE_ID
Real number (ℝ)
MISSING 
| Distinct | 2476590 |
|---|---|
| Distinct (%) | 48.2% |
| Missing | 217425 |
| Missing (%) | 4.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18521806 |
| Minimum | 123423 |
|---|---|
| Maximum | 20645072 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 40.8 MiB |
Quantile statistics
| Minimum | 123423 |
|---|---|
| 5-th percentile | 17033806 |
| Q1 | 17545317 |
| median | 18692171 |
| Q3 | 19728357 |
| 95-th percentile | 20475924 |
| Maximum | 20645072 |
| Range | 20521649 |
| Interquartile range (IQR) | 2183039.8 |
Descriptive statistics
| Standard deviation | 1560660.3 |
|---|---|
| Coefficient of variation (CV) | 0.084260696 |
| Kurtosis | 8.0441376 |
| Mean | 18521806 |
| Median Absolute Deviation (MAD) | 1099290 |
| Skewness | -1.8952034 |
| Sum | 9.5143445 × 1013 |
| Variance | 2.4356606 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 18590796 | 71 | < 0.1% |
| 17075216 | 63 | < 0.1% |
| 17334601 | 63 | < 0.1% |
| 17364088 | 60 | < 0.1% |
| 18954743 | 58 | < 0.1% |
| 18968693 | 58 | < 0.1% |
| 17483298 | 58 | < 0.1% |
| 17826063 | 58 | < 0.1% |
| 19106096 | 57 | < 0.1% |
| 17521817 | 57 | < 0.1% |
| Other values (2476580) | 5136231 | |
| (Missing) | 217425 | 4.1% |
| Value | Count | Frequency (%) |
| 123423 | 1 | < 0.1% |
| 602947 | 2 | |
| 611686 | 1 | < 0.1% |
| 620307 | 1 | < 0.1% |
| 621082 | 2 | |
| 622848 | 3 | |
| 625915 | 1 | < 0.1% |
| 628019 | 1 | < 0.1% |
| 629935 | 1 | < 0.1% |
| 630993 | 3 |
| Value | Count | Frequency (%) |
| 20645072 | 1 | < 0.1% |
| 20645071 | 2 | |
| 20645048 | 1 | < 0.1% |
| 20645047 | 1 | < 0.1% |
| 20645040 | 2 | |
| 20645039 | 3 | |
| 20645038 | 1 | < 0.1% |
| 20645037 | 2 | |
| 20645036 | 2 | |
| 20645035 | 1 | < 0.1% |
PERSON_AGE
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 886 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 573413 |
| Missing (%) | 10.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.250145 |
| Minimum | -999 |
|---|---|
| Maximum | 9999 |
| Zeros | 547074 |
| Zeros (%) | 10.2% |
| Negative | 1167 |
| Negative (%) | < 0.1% |
| Memory size | 40.8 MiB |
Quantile statistics
| Minimum | -999 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 24 |
| median | 35 |
| Q3 | 50 |
| 95-th percentile | 68 |
| Maximum | 9999 |
| Range | 10998 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 114.32082 |
|---|---|
| Coefficient of variation (CV) | 3.0690033 |
| Kurtosis | 5755.5251 |
| Mean | 37.250145 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 71.629174 |
| Sum | 1.7808721 × 108 |
| Variance | 13069.25 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 547074 | 10.2% |
| 30 | 108808 | 2.0% |
| 29 | 108607 | 2.0% |
| 28 | 108204 | 2.0% |
| 27 | 107798 | 2.0% |
| 31 | 104954 | 2.0% |
| 26 | 104646 | 2.0% |
| 32 | 103558 | 1.9% |
| 25 | 100621 | 1.9% |
| 33 | 100488 | 1.9% |
| Other values (876) | 3286088 | |
| (Missing) | 573413 | 10.7% |
| Value | Count | Frequency (%) |
| -999 | 8 | |
| -997 | 2 | < 0.1% |
| -996 | 1 | < 0.1% |
| -992 | 2 | < 0.1% |
| -991 | 1 | < 0.1% |
| -990 | 3 | < 0.1% |
| -989 | 1 | < 0.1% |
| -987 | 1 | < 0.1% |
| -982 | 3 | < 0.1% |
| -980 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9999 | 415 | |
| 9262 | 1 | < 0.1% |
| 9232 | 1 | < 0.1% |
| 9211 | 1 | < 0.1% |
| 9191 | 1 | < 0.1% |
| 9151 | 1 | < 0.1% |
| 9122 | 1 | < 0.1% |
| 8041 | 1 | < 0.1% |
| 7301 | 2 | < 0.1% |
| 7275 | 2 | < 0.1% |
EJECTION
Categorical
IMBALANCE  MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2607425 |
| Missing (%) | 48.7% |
| Memory size | 337.3 MiB |
| Not Ejected | |
|---|---|
| Ejected | 24499 |
| Does Not Apply | 15891 |
| Partially Ejected | 10739 |
| Trapped | 1272 |
Length
| Max length | 17 |
|---|---|
| Median length | 11 |
| Mean length | 11.002497 |
| Min length | 7 |
Characters and Unicode
| Total characters | 30222033 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not Ejected |
|---|---|
| 2nd row | Not Ejected |
| 3rd row | Not Ejected |
| 4th row | Not Ejected |
| 5th row | Not Ejected |
Common Values
| Value | Count | Frequency (%) |
| Not Ejected | 2693892 | |
| Ejected | 24499 | 0.5% |
| Does Not Apply | 15891 | 0.3% |
| Partially Ejected | 10739 | 0.2% |
| Trapped | 1272 | < 0.1% |
| Unknown | 541 | < 0.1% |
| (Missing) | 2607425 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ejected | 2729130 | |
| not | 2709783 | |
| does | 15891 | 0.3% |
| apply | 15891 | 0.3% |
| partially | 10739 | 0.2% |
| trapped | 1272 | < 0.1% |
| unknown | 541 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5475423 | |
| t | 5449652 | |
| 2736413 | ||
| d | 2730402 | |
| E | 2729130 | |
| j | 2729130 | |
| c | 2729130 | |
| o | 2726215 | |
| N | 2709783 | |
| l | 37369 | 0.1% |
| Other values (14) | 169386 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 30222033 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 5475423 | |
| t | 5449652 | |
| 2736413 | ||
| d | 2730402 | |
| E | 2729130 | |
| j | 2729130 | |
| c | 2729130 | |
| o | 2726215 | |
| N | 2709783 | |
| l | 37369 | 0.1% |
| Other values (14) | 169386 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 30222033 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 5475423 | |
| t | 5449652 | |
| 2736413 | ||
| d | 2730402 | |
| E | 2729130 | |
| j | 2729130 | |
| c | 2729130 | |
| o | 2726215 | |
| N | 2709783 | |
| l | 37369 | 0.1% |
| Other values (14) | 169386 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 30222033 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 5475423 | |
| t | 5449652 | |
| 2736413 | ||
| d | 2730402 | |
| E | 2729130 | |
| j | 2729130 | |
| c | 2729130 | |
| o | 2726215 | |
| N | 2709783 | |
| l | 37369 | 0.1% |
| Other values (14) | 169386 | 0.6% |
EMOTIONAL_STATUS
Categorical
IMBALANCE  MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2525502 |
| Missing (%) | 47.2% |
| Memory size | 343.3 MiB |
| Does Not Apply | |
|---|---|
| Conscious | |
| Unknown | 13540 |
| Shock | 13108 |
| Semiconscious | 2724 |
| Other values (3) | 6167 |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 13.11888 |
| Min length | 5 |
Characters and Unicode
| Total characters | 37110123 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Does Not Apply |
|---|---|
| 2nd row | Does Not Apply |
| 3rd row | Conscious |
| 4th row | Conscious |
| 5th row | Does Not Apply |
Common Values
| Value | Count | Frequency (%) |
| Does Not Apply | 2340767 | |
| Conscious | 452451 | 8.5% |
| Unknown | 13540 | 0.3% |
| Shock | 13108 | 0.2% |
| Semiconscious | 2724 | 0.1% |
| Unconscious | 2536 | < 0.1% |
| Apparent Death | 1847 | < 0.1% |
| Incoherent | 1784 | < 0.1% |
| (Missing) | 2525502 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| does | 2340767 | |
| not | 2340767 | |
| apply | 2340767 | |
| conscious | 452451 | 6.0% |
| unknown | 13540 | 0.2% |
| shock | 13108 | 0.2% |
| semiconscious | 2724 | < 0.1% |
| unconscious | 2536 | < 0.1% |
| apparent | 1847 | < 0.1% |
| death | 1847 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 5625388 | |
| p | 4685228 | |
| 4683381 | ||
| s | 3256189 | |
| e | 2350753 | |
| t | 2346245 | |
| D | 2342614 | |
| A | 2342614 | |
| N | 2340767 | |
| l | 2340767 | |
| Other values (15) | 4796177 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 37110123 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 5625388 | |
| p | 4685228 | |
| 4683381 | ||
| s | 3256189 | |
| e | 2350753 | |
| t | 2346245 | |
| D | 2342614 | |
| A | 2342614 | |
| N | 2340767 | |
| l | 2340767 | |
| Other values (15) | 4796177 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 37110123 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 5625388 | |
| p | 4685228 | |
| 4683381 | ||
| s | 3256189 | |
| e | 2350753 | |
| t | 2346245 | |
| D | 2342614 | |
| A | 2342614 | |
| N | 2340767 | |
| l | 2340767 | |
| Other values (15) | 4796177 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 37110123 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 5625388 | |
| p | 4685228 | |
| 4683381 | ||
| s | 3256189 | |
| e | 2350753 | |
| t | 2346245 | |
| D | 2342614 | |
| A | 2342614 | |
| N | 2340767 | |
| l | 2340767 | |
| Other values (15) | 4796177 |
BODILY_INJURY
Categorical
IMBALANCE  MISSING 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2525459 |
| Missing (%) | 47.2% |
| Memory size | 343.8 MiB |
| Does Not Apply | |
|---|---|
| Back | 76604 |
| Neck | 73655 |
| Knee-Lower Leg Foot | 69953 |
| Head | 63462 |
| Other values (9) | 173753 |
Length
| Max length | 20 |
|---|---|
| Median length | 14 |
| Mean length | 13.311595 |
| Min length | 3 |
Characters and Unicode
| Total characters | 37655839 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Does Not Apply |
|---|---|
| 2nd row | Does Not Apply |
| 3rd row | Back |
| 4th row | Shoulder - Upper Arm |
| 5th row | Does Not Apply |
Common Values
| Value | Count | Frequency (%) |
| Does Not Apply | 2371373 | |
| Back | 76604 | 1.4% |
| Neck | 73655 | 1.4% |
| Knee-Lower Leg Foot | 69953 | 1.3% |
| Head | 63462 | 1.2% |
| Entire Body | 37106 | 0.7% |
| Elbow-Lower-Arm-Hand | 31223 | 0.6% |
| Shoulder - Upper Arm | 31185 | 0.6% |
| Unknown | 19935 | 0.4% |
| Chest | 16808 | 0.3% |
| Other values (4) | 37496 | 0.7% |
| (Missing) | 2525459 |
Length
| Value | Count | Frequency (%) |
| does | 2371373 | |
| apply | 2371373 | |
| not | 2371373 | |
| leg | 86243 | 1.1% |
| back | 76604 | 1.0% |
| neck | 73655 | 0.9% |
| knee-lower | 69953 | 0.9% |
| foot | 69953 | 0.9% |
| head | 63462 | 0.8% |
| 39268 | 0.5% | |
| Other values (13) | 281312 | 3.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 5111360 | |
| 5045769 | ||
| p | 4853986 | |
| e | 2997678 | |
| t | 2495240 | |
| N | 2445028 | |
| A | 2441864 | |
| l | 2441864 | |
| y | 2409354 | |
| s | 2396264 | |
| Other values (26) | 5017432 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 37655839 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 5111360 | |
| 5045769 | ||
| p | 4853986 | |
| e | 2997678 | |
| t | 2495240 | |
| N | 2445028 | |
| A | 2441864 | |
| l | 2441864 | |
| y | 2409354 | |
| s | 2396264 | |
| Other values (26) | 5017432 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 37655839 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 5111360 | |
| 5045769 | ||
| p | 4853986 | |
| e | 2997678 | |
| t | 2495240 | |
| N | 2445028 | |
| A | 2441864 | |
| l | 2441864 | |
| y | 2409354 | |
| s | 2396264 | |
| Other values (26) | 5017432 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 37655839 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 5111360 | |
| 5045769 | ||
| p | 4853986 | |
| e | 2997678 | |
| t | 2495240 | |
| N | 2445028 | |
| A | 2441864 | |
| l | 2441864 | |
| y | 2409354 | |
| s | 2396264 | |
| Other values (26) | 5017432 |
POSITION_IN_VEHICLE
Categorical
IMBALANCE  MISSING 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2607042 |
| Missing (%) | 48.7% |
| Memory size | 372.8 MiB |
| Driver | |
|---|---|
| Front passenger, if two or more persons, including the driver, are in the front seat | |
| Right rear passenger or motorcycle sidecar passenger | 137225 |
| Left rear passenger, or rear passenger on a bicycle, motorcycle, snowmobile | 128534 |
| Any person in the rear of a station wagon, pick-up truck, all passengers on a bus, etc | 74244 |
| Other values (6) | 150378 |
Length
| Max length | 86 |
|---|---|
| Median length | 6 |
| Mean length | 24.571 |
| Min length | 6 |
Characters and Unicode
| Total characters | 67501868 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Front passenger, if two or more persons, including the driver, are in the front seat |
|---|---|
| 2nd row | Right rear passenger or motorcycle sidecar passenger |
| 3rd row | Driver |
| 4th row | Driver |
| 5th row | Driver |
Common Values
| Value | Count | Frequency (%) |
| Driver | 1919815 | |
| Front passenger, if two or more persons, including the driver, are in the front seat | 337021 | 6.3% |
| Right rear passenger or motorcycle sidecar passenger | 137225 | 2.6% |
| Left rear passenger, or rear passenger on a bicycle, motorcycle, snowmobile | 128534 | 2.4% |
| Any person in the rear of a station wagon, pick-up truck, all passengers on a bus, etc | 74244 | 1.4% |
| Unknown | 63889 | 1.2% |
| Middle rear seat, or passenger lying across a seat | 41514 | 0.8% |
| Middle front seat, or passenger lying across a seat | 33717 | 0.6% |
| Riding/Hanging on Outside | 7114 | 0.1% |
| Does Not Apply | 3246 | 0.1% |
| (Missing) | 2607042 |
Length
| Value | Count | Frequency (%) |
| driver | 2256836 | |
| passenger | 943770 | 8.3% |
| the | 748286 | 6.6% |
| front | 707759 | 6.2% |
| or | 678011 | 6.0% |
| rear | 510051 | 4.5% |
| seat | 487483 | 4.3% |
| in | 411265 | 3.6% |
| a | 352253 | 3.1% |
| if | 337919 | 3.0% |
| Other values (38) | 3958224 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 9578018 | |
| 8644640 | ||
| e | 8077800 | |
| i | 4538986 | 6.7% |
| n | 4076233 | 6.0% |
| s | 3926498 | 5.8% |
| o | 3843287 | 5.7% |
| a | 3150716 | 4.7% |
| t | 3121199 | 4.6% |
| v | 2256836 | 3.3% |
| Other values (29) | 16287655 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 67501868 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 9578018 | |
| 8644640 | ||
| e | 8077800 | |
| i | 4538986 | 6.7% |
| n | 4076233 | 6.0% |
| s | 3926498 | 5.8% |
| o | 3843287 | 5.7% |
| a | 3150716 | 4.7% |
| t | 3121199 | 4.6% |
| v | 2256836 | 3.3% |
| Other values (29) | 16287655 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 67501868 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 9578018 | |
| 8644640 | ||
| e | 8077800 | |
| i | 4538986 | 6.7% |
| n | 4076233 | 6.0% |
| s | 3926498 | 5.8% |
| o | 3843287 | 5.7% |
| a | 3150716 | 4.7% |
| t | 3121199 | 4.6% |
| v | 2256836 | 3.3% |
| Other values (29) | 16287655 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 67501868 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 9578018 | |
| 8644640 | ||
| e | 8077800 | |
| i | 4538986 | 6.7% |
| n | 4076233 | 6.0% |
| s | 3926498 | 5.8% |
| o | 3843287 | 5.7% |
| a | 3150716 | 4.7% |
| t | 3121199 | 4.6% |
| v | 2256836 | 3.3% |
| Other values (29) | 16287655 |
SAFETY_EQUIPMENT
Categorical
IMBALANCE  MISSING 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2779911 |
| Missing (%) | 51.9% |
| Memory size | 346.1 MiB |
| Lap Belt & Harness | |
|---|---|
| Unknown | |
| Lap Belt | |
| Child Restraint Only | 44909 |
| Air Bag Deployed/Lap Belt/Harness | 18962 |
| Other values (12) | 70714 |
Length
| Max length | 40 |
|---|---|
| Median length | 18 |
| Mean length | 14.874828 |
| Min length | 1 |
Characters and Unicode
| Total characters | 38292984 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Lap Belt & Harness |
|---|---|
| 2nd row | Lap Belt |
| 3rd row | Lap Belt & Harness |
| 4th row | Lap Belt & Harness |
| 5th row | Lap Belt & Harness |
Common Values
| Value | Count | Frequency (%) |
| Lap Belt & Harness | 1648825 | |
| Unknown | 429584 | 8.0% |
| Lap Belt | 361354 | 6.7% |
| Child Restraint Only | 44909 | 0.8% |
| Air Bag Deployed/Lap Belt/Harness | 18962 | 0.4% |
| Other | 14920 | 0.3% |
| Helmet (Motorcycle Only) | 13070 | 0.2% |
| Harness | 12181 | 0.2% |
| Helmet Only (In-Line Skater/Bicyclist) | 9984 | 0.2% |
| - | 7185 | 0.1% |
| Other values (7) | 13374 | 0.2% |
| (Missing) | 2779911 |
Length
| Value | Count | Frequency (%) |
| belt | 2013160 | |
| lap | 2010181 | |
| harness | 1661006 | |
| 1656010 | ||
| unknown | 429584 | 5.3% |
| only | 68581 | 0.8% |
| restraint | 45449 | 0.6% |
| child | 44909 | 0.6% |
| air | 28751 | 0.4% |
| bag | 28751 | 0.4% |
| Other values (12) | 129476 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5541510 | ||
| e | 3925711 | |
| a | 3799940 | |
| s | 3419574 | |
| n | 3109886 | 8.1% |
| l | 2227561 | 5.8% |
| t | 2207669 | 5.8% |
| B | 2074442 | 5.4% |
| p | 2061953 | 5.4% |
| L | 2045691 | 5.3% |
| Other values (27) | 7879047 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 38292984 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5541510 | ||
| e | 3925711 | |
| a | 3799940 | |
| s | 3419574 | |
| n | 3109886 | 8.1% |
| l | 2227561 | 5.8% |
| t | 2207669 | 5.8% |
| B | 2074442 | 5.4% |
| p | 2061953 | 5.4% |
| L | 2045691 | 5.3% |
| Other values (27) | 7879047 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 38292984 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5541510 | ||
| e | 3925711 | |
| a | 3799940 | |
| s | 3419574 | |
| n | 3109886 | 8.1% |
| l | 2227561 | 5.8% |
| t | 2207669 | 5.8% |
| B | 2074442 | 5.4% |
| p | 2061953 | 5.4% |
| L | 2045691 | 5.3% |
| Other values (27) | 7879047 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 38292984 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5541510 | ||
| e | 3925711 | |
| a | 3799940 | |
| s | 3419574 | |
| n | 3109886 | 8.1% |
| l | 2227561 | 5.8% |
| t | 2207669 | 5.8% |
| B | 2074442 | 5.4% |
| p | 2061953 | 5.4% |
| L | 2045691 | 5.3% |
| Other values (27) | 7879047 |
PED_LOCATION
Categorical
MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5267506 |
| Missing (%) | 98.4% |
| Memory size | 330.5 MiB |
| Pedestrian/Bicyclist/Other Pedestrian at Intersection | |
|---|---|
| Pedestrian/Bicyclist/Other Pedestrian Not at Intersection | |
| Does Not Apply | 3426 |
| Unknown | 2464 |
Length
| Max length | 57 |
|---|---|
| Median length | 53 |
| Mean length | 51.447108 |
| Min length | 7 |
Characters and Unicode
| Total characters | 4463191 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Pedestrian/Bicyclist/Other Pedestrian at Intersection |
|---|---|
| 2nd row | Pedestrian/Bicyclist/Other Pedestrian at Intersection |
| 3rd row | Pedestrian/Bicyclist/Other Pedestrian Not at Intersection |
| 4th row | Pedestrian/Bicyclist/Other Pedestrian at Intersection |
| 5th row | Pedestrian/Bicyclist/Other Pedestrian at Intersection |
Common Values
| Value | Count | Frequency (%) |
| Pedestrian/Bicyclist/Other Pedestrian at Intersection | 52803 | 1.0% |
| Pedestrian/Bicyclist/Other Pedestrian Not at Intersection | 28060 | 0.5% |
| Does Not Apply | 3426 | 0.1% |
| Unknown | 2464 | < 0.1% |
| (Missing) | 5267506 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| pedestrian/bicyclist/other | 80863 | |
| pedestrian | 80863 | |
| at | 80863 | |
| intersection | 80863 | |
| not | 31486 | 8.6% |
| does | 3426 | 0.9% |
| apply | 3426 | 0.9% |
| unknown | 2464 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 597527 | |
| e | 569467 | |
| i | 404315 | |
| n | 330844 | 7.4% |
| s | 326878 | 7.3% |
| r | 323452 | 7.2% |
| 277501 | 6.2% | |
| a | 242589 | 5.4% |
| c | 242589 | 5.4% |
| P | 161726 | 3.6% |
| Other values (16) | 986303 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4463191 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 597527 | |
| e | 569467 | |
| i | 404315 | |
| n | 330844 | 7.4% |
| s | 326878 | 7.3% |
| r | 323452 | 7.2% |
| 277501 | 6.2% | |
| a | 242589 | 5.4% |
| c | 242589 | 5.4% |
| P | 161726 | 3.6% |
| Other values (16) | 986303 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4463191 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 597527 | |
| e | 569467 | |
| i | 404315 | |
| n | 330844 | 7.4% |
| s | 326878 | 7.3% |
| r | 323452 | 7.2% |
| 277501 | 6.2% | |
| a | 242589 | 5.4% |
| c | 242589 | 5.4% |
| P | 161726 | 3.6% |
| Other values (16) | 986303 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4463191 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 597527 | |
| e | 569467 | |
| i | 404315 | |
| n | 330844 | 7.4% |
| s | 326878 | 7.3% |
| r | 323452 | 7.2% |
| 277501 | 6.2% | |
| a | 242589 | 5.4% |
| c | 242589 | 5.4% |
| P | 161726 | 3.6% |
| Other values (16) | 986303 |
PED_ACTION
Categorical
MISSING 
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5267607 |
| Missing (%) | 98.4% |
| Memory size | 328.2 MiB |
| Crossing With Signal | |
|---|---|
| Crossing, No Signal, or Crosswalk | |
| Crossing, No Signal, Marked Crosswalk | |
| Other Actions in Roadway | |
| Crossing Against Signal | |
| Other values (11) |
Length
| Max length | 47 |
|---|---|
| Median length | 44 |
| Mean length | 24.471068 |
| Min length | 7 |
Characters and Unicode
| Total characters | 2120467 |
|---|---|
| Distinct characters | 41 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Crossing With Signal |
|---|---|
| 2nd row | Crossing With Signal |
| 3rd row | Crossing, No Signal, or Crosswalk |
| 4th row | Crossing With Signal |
| 5th row | Crossing With Signal |
Common Values
| Value | Count | Frequency (%) |
| Crossing With Signal | 32525 | 0.6% |
| Crossing, No Signal, or Crosswalk | 14729 | 0.3% |
| Crossing, No Signal, Marked Crosswalk | 7424 | 0.1% |
| Other Actions in Roadway | 6696 | 0.1% |
| Crossing Against Signal | 6058 | 0.1% |
| Unknown | 4142 | 0.1% |
| Not in Roadway | 4033 | 0.1% |
| Does Not Apply | 3852 | 0.1% |
| Emerging from in Front of/Behind Parked Vehicle | 2765 | 0.1% |
| Working in Roadway | 1321 | < 0.1% |
| Other values (6) | 3107 | 0.1% |
| (Missing) | 5267607 |
Length
| Value | Count | Frequency (%) |
| crossing | 60736 | |
| signal | 60736 | |
| with | 33390 | |
| crosswalk | 22153 | 6.9% |
| no | 22153 | 6.9% |
| in | 15291 | 4.8% |
| or | 14729 | 4.6% |
| roadway | 12526 | 3.9% |
| other | 7887 | 2.5% |
| not | 7885 | 2.5% |
| Other values (29) | 63454 |
Most occurring characters
| Value | Count | Frequency (%) |
| 234288 | ||
| i | 201925 | 9.5% |
| s | 184150 | 8.7% |
| n | 180071 | 8.5% |
| o | 168884 | 8.0% |
| g | 141442 | 6.7% |
| a | 129809 | 6.1% |
| r | 126969 | 6.0% |
| l | 94735 | 4.5% |
| C | 83108 | 3.9% |
| Other values (31) | 575086 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2120467 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 234288 | ||
| i | 201925 | 9.5% |
| s | 184150 | 8.7% |
| n | 180071 | 8.5% |
| o | 168884 | 8.0% |
| g | 141442 | 6.7% |
| a | 129809 | 6.1% |
| r | 126969 | 6.0% |
| l | 94735 | 4.5% |
| C | 83108 | 3.9% |
| Other values (31) | 575086 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2120467 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 234288 | ||
| i | 201925 | 9.5% |
| s | 184150 | 8.7% |
| n | 180071 | 8.5% |
| o | 168884 | 8.0% |
| g | 141442 | 6.7% |
| a | 129809 | 6.1% |
| r | 126969 | 6.0% |
| l | 94735 | 4.5% |
| C | 83108 | 3.9% |
| Other values (31) | 575086 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2120467 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 234288 | ||
| i | 201925 | 9.5% |
| s | 184150 | 8.7% |
| n | 180071 | 8.5% |
| o | 168884 | 8.0% |
| g | 141442 | 6.7% |
| a | 129809 | 6.1% |
| r | 126969 | 6.0% |
| l | 94735 | 4.5% |
| C | 83108 | 3.9% |
| Other values (31) | 575086 |
COMPLAINT
Categorical
IMBALANCE  MISSING 
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2525452 |
| Missing (%) | 47.2% |
| Memory size | 348.2 MiB |
| Does Not Apply | |
|---|---|
| Complaint of Pain or Nausea | 201250 |
| Complaint of Pain | 88497 |
| None Visible | 46571 |
| Minor Bleeding | 24567 |
| Other values (16) | 95753 |
Length
| Max length | 34 |
|---|---|
| Median length | 14 |
| Mean length | 14.921692 |
| Min length | 7 |
Characters and Unicode
| Total characters | 42210588 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Does Not Apply |
|---|---|
| 2nd row | Does Not Apply |
| 3rd row | Complaint of Pain or Nausea |
| 4th row | None Visible |
| 5th row | Does Not Apply |
Common Values
| Value | Count | Frequency (%) |
| Does Not Apply | 2372169 | |
| Complaint of Pain or Nausea | 201250 | 3.8% |
| Complaint of Pain | 88497 | 1.7% |
| None Visible | 46571 | 0.9% |
| Minor Bleeding | 24567 | 0.5% |
| Contusion - Bruise | 19260 | 0.4% |
| Unknown | 19049 | 0.4% |
| Whiplash | 18560 | 0.3% |
| Abrasion | 14033 | 0.3% |
| Internal | 7395 | 0.1% |
| Other values (11) | 17456 | 0.3% |
| (Missing) | 2525452 |
Length
| Value | Count | Frequency (%) |
| does | 2372169 | |
| not | 2372169 | |
| apply | 2372169 | |
| complaint | 289747 | 3.3% |
| of | 289747 | 3.3% |
| pain | 289747 | 3.3% |
| or | 201250 | 2.3% |
| nausea | 201250 | 2.3% |
| none | 46571 | 0.5% |
| visible | 46571 | 0.5% |
| Other values (21) | 216030 | 2.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5868613 | ||
| o | 5697405 | |
| p | 5052798 | |
| e | 2775795 | 6.6% |
| l | 2769320 | 6.6% |
| t | 2716250 | 6.4% |
| s | 2714180 | 6.4% |
| N | 2619990 | 6.2% |
| A | 2386355 | 5.7% |
| D | 2385200 | 5.7% |
| Other values (29) | 7224682 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 42210588 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5868613 | ||
| o | 5697405 | |
| p | 5052798 | |
| e | 2775795 | 6.6% |
| l | 2769320 | 6.6% |
| t | 2716250 | 6.4% |
| s | 2714180 | 6.4% |
| N | 2619990 | 6.2% |
| A | 2386355 | 5.7% |
| D | 2385200 | 5.7% |
| Other values (29) | 7224682 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 42210588 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5868613 | ||
| o | 5697405 | |
| p | 5052798 | |
| e | 2775795 | 6.6% |
| l | 2769320 | 6.6% |
| t | 2716250 | 6.4% |
| s | 2714180 | 6.4% |
| N | 2619990 | 6.2% |
| A | 2386355 | 5.7% |
| D | 2385200 | 5.7% |
| Other values (29) | 7224682 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 42210588 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5868613 | ||
| o | 5697405 | |
| p | 5052798 | |
| e | 2775795 | 6.6% |
| l | 2769320 | 6.6% |
| t | 2716250 | 6.4% |
| s | 2714180 | 6.4% |
| N | 2619990 | 6.2% |
| A | 2386355 | 5.7% |
| D | 2385200 | 5.7% |
| Other values (29) | 7224682 |
PED_ROLE
Categorical
MISSING 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 194889 |
| Missing (%) | 3.6% |
| Memory size | 333.0 MiB |
| Registrant | |
|---|---|
| Driver | |
| Passenger | |
| Pedestrian | 85171 |
| Witness | 72450 |
| Other values (5) | 39480 |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 8.2661449 |
| Min length | 5 |
Characters and Unicode
| Total characters | 42648100 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Registrant |
|---|---|
| 2nd row | Passenger |
| 3rd row | Registrant |
| 4th row | Notified Person |
| 5th row | Passenger |
Common Values
| Value | Count | Frequency (%) |
| Registrant | 2220389 | |
| Driver | 1964972 | |
| Passenger | 776908 | 14.5% |
| Pedestrian | 85171 | 1.6% |
| Witness | 72450 | 1.4% |
| Owner | 26682 | 0.5% |
| Notified Person | 8344 | 0.2% |
| Policy Holder | 2415 | < 0.1% |
| Other | 1685 | < 0.1% |
| In-Line Skater | 354 | < 0.1% |
| (Missing) | 194889 | 3.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| registrant | 2220389 | |
| driver | 1964972 | |
| passenger | 776908 | 15.0% |
| pedestrian | 85171 | 1.6% |
| witness | 72450 | 1.4% |
| owner | 26682 | 0.5% |
| notified | 8344 | 0.2% |
| person | 8344 | 0.2% |
| policy | 2415 | < 0.1% |
| holder | 2415 | < 0.1% |
| Other values (3) | 2393 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 7051892 | |
| e | 6030147 | |
| t | 4608782 | |
| i | 4362439 | |
| s | 4012620 | |
| n | 3190652 | |
| a | 3082822 | |
| g | 2997297 | |
| R | 2220389 | 5.2% |
| D | 1964972 | 4.6% |
| Other values (20) | 3126088 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 42648100 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 7051892 | |
| e | 6030147 | |
| t | 4608782 | |
| i | 4362439 | |
| s | 4012620 | |
| n | 3190652 | |
| a | 3082822 | |
| g | 2997297 | |
| R | 2220389 | 5.2% |
| D | 1964972 | 4.6% |
| Other values (20) | 3126088 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 42648100 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 7051892 | |
| e | 6030147 | |
| t | 4608782 | |
| i | 4362439 | |
| s | 4012620 | |
| n | 3190652 | |
| a | 3082822 | |
| g | 2997297 | |
| R | 2220389 | 5.2% |
| D | 1964972 | 4.6% |
| Other values (20) | 3126088 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 42648100 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 7051892 | |
| e | 6030147 | |
| t | 4608782 | |
| i | 4362439 | |
| s | 4012620 | |
| n | 3190652 | |
| a | 3082822 | |
| g | 2997297 | |
| R | 2220389 | 5.2% |
| D | 1964972 | 4.6% |
| Other values (20) | 3126088 |
MISSING 
| Distinct | 53 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 5268834 |
| Missing (%) | 98.4% |
| Memory size | 167.0 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 11 |
| Mean length | 19.60295 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1674582 |
|---|---|
| Distinct characters | 52 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 59729 | |
| pedestrian/bicyclist/other | 13527 | 10.2% |
| pedestrian | 13527 | 10.2% |
| error/confusion | 13527 | 10.2% |
| driver | 3177 | 2.4% |
| inattention/distraction | 3099 | 2.3% |
| to | 2321 | 1.8% |
| failure | 2263 | 1.7% |
| right-of-way | 2241 | 1.7% |
| yield | 2241 | 1.7% |
| Other values (90) | 16503 | 12.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 215196 | |
| e | 212540 | |
| n | 134022 | 8.0% |
| s | 122238 | 7.3% |
| r | 103575 | 6.2% |
| c | 95265 | 5.7% |
| d | 94213 | 5.6% |
| t | 80850 | 4.8% |
| f | 78813 | 4.7% |
| p | 60884 | 3.6% |
| Other values (42) | 476986 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1674582 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 215196 | |
| e | 212540 | |
| n | 134022 | 8.0% |
| s | 122238 | 7.3% |
| r | 103575 | 6.2% |
| c | 95265 | 5.7% |
| d | 94213 | 5.6% |
| t | 80850 | 4.8% |
| f | 78813 | 4.7% |
| p | 60884 | 3.6% |
| Other values (42) | 476986 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1674582 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 215196 | |
| e | 212540 | |
| n | 134022 | 8.0% |
| s | 122238 | 7.3% |
| r | 103575 | 6.2% |
| c | 95265 | 5.7% |
| d | 94213 | 5.6% |
| t | 80850 | 4.8% |
| f | 78813 | 4.7% |
| p | 60884 | 3.6% |
| Other values (42) | 476986 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1674582 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 215196 | |
| e | 212540 | |
| n | 134022 | 8.0% |
| s | 122238 | 7.3% |
| r | 103575 | 6.2% |
| c | 95265 | 5.7% |
| d | 94213 | 5.6% |
| t | 80850 | 4.8% |
| f | 78813 | 4.7% |
| p | 60884 | 3.6% |
| Other values (42) | 476986 |
MISSING 
| Distinct | 51 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 5268945 |
| Missing (%) | 98.4% |
| Memory size | 166.6 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 11 |
| Mean length | 13.841515 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1180875 |
|---|---|
| Distinct characters | 52 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 75142 | |
| pedestrian/bicyclist/other | 3738 | 3.6% |
| pedestrian | 3738 | 3.6% |
| error/confusion | 3738 | 3.6% |
| driver | 1343 | 1.3% |
| inattention/distraction | 1221 | 1.2% |
| to | 1213 | 1.2% |
| failure | 1181 | 1.1% |
| yield | 1158 | 1.1% |
| right-of-way | 1158 | 1.1% |
| Other values (84) | 9925 | 9.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 183294 | |
| e | 182730 | |
| n | 99331 | |
| s | 94694 | |
| d | 87313 | |
| c | 86861 | |
| f | 82347 | |
| p | 76084 | |
| U | 75601 | |
| r | 34647 | 2.9% |
| Other values (42) | 177973 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1180875 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 183294 | |
| e | 182730 | |
| n | 99331 | |
| s | 94694 | |
| d | 87313 | |
| c | 86861 | |
| f | 82347 | |
| p | 76084 | |
| U | 75601 | |
| r | 34647 | 2.9% |
| Other values (42) | 177973 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1180875 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 183294 | |
| e | 182730 | |
| n | 99331 | |
| s | 94694 | |
| d | 87313 | |
| c | 86861 | |
| f | 82347 | |
| p | 76084 | |
| U | 75601 | |
| r | 34647 | 2.9% |
| Other values (42) | 177973 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1180875 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 183294 | |
| e | 182730 | |
| n | 99331 | |
| s | 94694 | |
| d | 87313 | |
| c | 86861 | |
| f | 82347 | |
| p | 76084 | |
| U | 75601 | |
| r | 34647 | 2.9% |
| Other values (42) | 177973 |
PERSON_SEX
Categorical
MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 600519 |
| Missing (%) | 11.2% |
| Memory size | 299.6 MiB |
| M | |
|---|---|
| F | |
| U |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4753740 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | U |
|---|---|
| 2nd row | F |
| 3rd row | M |
| 4th row | F |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 2878999 | |
| F | 1447674 | |
| U | 427067 | 8.0% |
| (Missing) | 600519 | 11.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 2878999 | |
| f | 1447674 | |
| u | 427067 | 9.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 2878999 | |
| F | 1447674 | |
| U | 427067 | 9.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4753740 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 2878999 | |
| F | 1447674 | |
| U | 427067 | 9.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4753740 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 2878999 | |
| F | 1447674 | |
| U | 427067 | 9.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4753740 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 2878999 | |
| F | 1447674 | |
| U | 427067 | 9.0% |